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Abstract — In this paper, we introduce the Causal Cognitive 
Interference Channel With Delay (CC-IFC-WD) in which the 
cognitive user transmission can depend on L future received 
symbols as well as the past ones. Taking the effect of the link 
delays into account, CC-IFC-WD fills the gap between the genie- 
aided and causal cognitive radio channels. We study three special 
cases: 1) Classical CC-IFC (L = 0), 2) CC-IFC without delay 
(L = 1) and 3) CC-IFC with a block length delay (L = n). In 
each case, we obtain an inner bound on the capacity region. Our 
coding schemes make use of cooperative strategy by generalized 
block Markov superposition coding, collaborative strategy by rate 
splitting, and Gel'fand-Pinsker coding in order to pre-cancel part 
of the interference. Moreover, instantaneous relaying and non- 
causal partial Decode-and-Forward strategies are employed in the 
second and third cases, respectively. The derived regions under 
special conditions, reduce to several previously known results. 
Moreover, we show that the coding strategy which we use to 
derive achievable rate region for the classical CC-IFC achieves 
capacity for a special case of this channel. Furthermore, we 
extend our achievable rate regions to Gaussian case. Providing 
a numerical example for Gaussian CC-IFC-WD, we investigate 
the rate gain of the cognitive link for different delay values. 

I. Introduction 

Cognitive Interference Channel (C-IFC) refers to a two-user 
interference channel in which the cognitive user (secondary user) 
has the ability to obtain the message being transmitted by the other 
user (primary user), either in a non-causal or causal manner. C- 
IFC was first introduced in U, where for the non-causal C-IFC 
an achievable rate region is derived by combining the Gel'fand- 
Pinsker (GP) binning [2] and a well known simultaneous super- 
position coding scheme (rate splitting) applied to the Interference 
Channel (IFC) |3]. For the non-causal C-IFC, where the cognitive 
user has non-causal full or partial knowledge of the other user's 
transmitted message, several achievable rate regions and capacity 
results in some special cases have been established 141-171. 

In the Causal C-IFC (CC-IFC), the cognitive user can exploit 
knowledge of the primary user's message from the causally re- 
ceived signals (information overheard by the feedback link from 
the channel and not sent back from the receivers). Due to the 
complex nature of the problem, CC-IFC which is a more realistic 
and appropriate model for practical applications than the non- 
causal C-IFC, has been far less investigated compared with the 
latter (H. In [TJ, achievable rate regions for the CC-IFC that 
consist of the non-cooperative causal transmission protocols have 
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been characterized. An improved rate region for CC-IFC employ- 
ing a cooperative coding strategy based on the block Markov 
superposition coding (full Decode-and-Forward (DF) |9|) and GP 
coding was derived in 1 10]. A more general model in which 
both transmitters are causally cognitive was proposed in Hill , 
called Interference Channel with Generalized Feedback (IFC-GF). 
Different achievable rate regions for IFC-GF were obtained in 
1111 - 1131 . combining the methods of rate splitting, block Markov 
superposition coding, and GP binning. 

In this paper, we define the Causal Cognitive Interference 
Channel With Delay (CC-IFC-WD) as an IFC where one of the 
transmitters can causally overhear the channel and its transmission 
can depend on the L future received symbols as well as the past 
ones. This can equivalently be seen as the classical CC-IFC with 
—L unit delay on the cognitive user's received signal (or on the 
link between the transmitters). This channel model fits the wireless 
networks where the transmitters are close. Moreover, CC-IFC-WD 
is a middle point between the unrealistic genie-aided (non-causal) 
C-IFC and complex CC-IFC. In fact, a simple strategy such as 
Instantaneous Relaying (IR) by itself could be beneficial, as the 
case in the Relay With Delay (RWD) channel 1 14 1. Different upper 
and lower bounds and some capacity results have been derived for 
RWD in I14l - fl6l . where the lower bounds are achieved based on 
the combination of cooperative strategies such as: full or partial 
DF, IR (for L > 0), and non-causal DF (for L = n). It has 
been shown that the capacity of the RWD is strictly larger than 
the classic relay channel 1141 . 

In this paper, after introducing the general CC-IFC-WD, we 
focus on three special cases: 1) L = which corresponds to the 
classical CC-IFC, 2) CC-IFC without delay (L = 1) where current 
received symbol (at the cognitive user) could also be utilized and 
3) CC-IFC with a block length delay (L = ri), in which cognitive 
user knows its entire received sequence non-causally. In each 
case, we obtain new inner bound on the capacity region (achiev- 
able rate region) for the general discrete memoryless case. Our 
coding schemes benefit the cooperative strategy by generalized 
block Markov coding (partial DF ]9|) and superposition coding, 
collaborative strategy by rate splitting and GP coding in order to 
mitigate part of the interference. For the first case (classic CC- 
IFC), we use a different strategy compared to the previous results. 
We use partial DF instead of full DF. Therefore our achievable 
region improves that of [10]. Moreover, since common message 
should be decoded in both receivers, binning against the common 
message provides no improvement. Therefore, we use GP binning 
to pre-cancel the part of the private message. A similar result has 
been concluded in 1171 for the Cognitive Z-IFC. In the second and 
third cases, besides the approach we adopt for the first case, IR and 



non-causal partial DF strategies are employed, respectively. The 
derived achievable rate regions, under special conditions, reduce 
to several previously known rate regions, such as the ones in 
(3J, 1141 . Moreover, we derive the capacity region for a special 
case of the CC-IFC-WD, where achievablity follows from our 
derived region. Furthermore, we consider Gaussian CC-IFC-WD 
and extend the achievable rate regions for L = 0, L = 1 and 
L = n, to the Gaussian case. Providing a numerical example 
for Gaussian CC-IFC-WD, we investigate the rate gain of the 
cognitive link for different delay values. Thus, we compare the 
strategies which are used for our coding schemes and show that 
IR and non-causal DF improve the rate region noticeably. 

The rest of the paper is organized as follows. Section [TT] intro- 
duces the general CC-IFC-WD channel model and the notations. 
In Section|lIll we consider three different scenarios and derive new 
inner bound on the capacity region for each scenario. Capacity 
region for a special case of the CC-IFC-WD is derived in Section 
HVl In Section|V] Gaussian CC-IFC-WD is investigated. 

II. Channel Models and Preliminaries 

Throughout the paper, upper case letters (e.g. X) are used to 
denote random variables (RVs) and lower case letters (e.g. x) 
show their realizations. The probability mass function (p.m.f) of 
a random variable (RV) X with alphabet set X, is denoted by 
px{x), where occasionally subscript X is omitted. \X\ denotes 
the cardinality of a finite discrete set X. A™(X, Y) specifies the 
set of e-strongly, jointly typical sequences of length n, abbreviated 
by A™ if it is clear. The notation Xf indicates a sequence of RVs 
(Xi, Xi + \, Xj), where we use X 3 instead of X x , for brevity. 

Consider the CC-IFC-WD in Fig[Q which is denoted by (X x x 
X 2 ,p(u2, 2/3, Vi\xi, x 2 ), 34 x 34 x 34), where X\ G X and 
X 2 € X 2 are inputs of Transmitter 1 (Txl) and Transmitter 2 
(Tx2), respectively, Y 2 G y 2 is the secondary user output, 
^3 € 34 and 14 G 34 are channel outputs at the Receiver 1 
(Rxl) and Receiver 2 (Rx2), respectively, p(y 2 , 2/3, 2/4^1, x 2 ) is 
the channel transition probability distribution. In n channel uses, 
each Txu sends a message m u to the Rxu where u G {1, 2}. 

Definition 1: A (2 nRl , 2 nR2 , n) code for the CC-IFC-WD 
consists of (i) two message sets Aii = {1, 2 nRl } and A4 2 = 
{1, 2 nR2 } for the primary and secondary users, respectively, 

(ii) an encoding function at the primary user fx : M.\ 1— > X™, 

(iii) a set of encoding functions at the secondary user x 2 ,i = 



Primary user 



/2,i(m 2 ,2/2 



for 1 < i < n and m 2 G A4 2 , (iv) two 



decoding functions at Rxl and Rx2, gi : 1— > M\ and 
92 ■ y r £ i-> M-2- We assume that the channel is memoryless. 
Thus, for mi G M.\ and VH2 G M.2, the joint p.m.f of 

Mi x M.2 x Xi x X 2 x 34 x 34 x 34 is given by 

n 

p(mi,m 2 , xi, £2,2/2,2/3,2/4) =p(mi)p(m 2 ) \~[ p(a;i,i|rai) 

xp(x2, l \m 2 ,y2~ 1+L )p(y2,i\xi > i)p(y 3 , l ,y4, l \x 1>i ,x 2 ,i) 

(1) 

where we avoid instantaneous feedback from X2 to Y2, which 
delay may cause. The probability of error for this code is defined 
as P e = max{P e _i, P e ,2\, where for u G {1, 2} we have: 



p — 

± e,u — 



1 



2n(R!+R 2 ) 



P (9u{Y™ +2 ) f m u \(m 1 ,m 2 ) sent) 



Definition 2: A rate pair (Ri, R2) is achievable if there exists 
a sequence of (2 nRl , 2 nR2 , n) codes with P e — > as n — > 00. 
The capacity region Cl, is closure of set of all achievable rates. 
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Fig. 1. Causal Cognitive Interference Channel With Delay (CC-IFC-WD) 



III. Discrete Memoryless CC-IFC-WD 

In this section, we consider the discrete memoryless CC-IFC- 
WD and concentrate on three special cases: 1) Classical CC-IFC 
(L = 0), 2) CC-IFC without delay (L = 1) where current received 
symbol (at the cognitive user) can be utilized too and 3) CC-IFC 
with a block length delay (L = n), in which cognitive user knows 
its entire received sequence non-causally. For all setups, new inner 
bounds on the capacity region are derived. We utilize a coding 
scheme which is based on combining generalized block Markov 
superposition coding, rate splitting and GP binning for part of the 
interference. Furthermore, we apply IR in the second setup and 
non-causal partial DF in the last case. The outline of the proofs 
are presented. 

A. Classical CC-IFC (L = 0) 

We present a new achievable rate region for this setup. 
Consider auxiliary RVs T c , T pi U\ c , U\ p , V\ c , V\ p , [4e, U 2p and 
a time sharing RV Q defined on arbitrary finite sets 
Tc,T p ,Ui c ,Ui p ,Vi Cl Vi p ,U 2c ,ll 2p and Q, respectively. Let 

%1 = {Qi T c , T p , Ulc, Ui p , Vic, Vi p , U2c: U2 P , Xi, X2, 54, Y3, 

Y4), and V\ denotes the set of all joint p.m.fs on Z\ that can 
be factored in the form of 

: P(q)p(tc\q)p(t p \t c , q)p{u lc \t c , q)p(u lp \u lc , t p , i 



P( z i) = P{q)p{tc\q)p{tp\t c , q)p{u lc \t Cl q)p(ui p \u lc , t p , t c , q) 
xp(vi c \t c , q)p(vip\vi c , t p , t c , q)p(xi \vi p , V Xc , u Xp , u Xc , t p , t c , q) 
xp(u2c, u 2p \t p , t c , q)p(x 2 \u 2c , u 2p , t p , t c , q)p(y2, 2/3, 2/4^1, X 2 ) 

(2) 

Let IZi(Zi) denote set of all nonnegative rate pairs (R\, R 2 ) 
where R\ = Riod + R\0n + R\id + Run and R 2 = R 2 q + R 22 , 
such that there exists nonnegative (£20, ^22) satisfying: 





L20 > 


/ 


(U 2c ;Tp\T c Q) = h 


(3) 




^22 > 


I 


(U 2p ;T p \T c Q) = 1 2 


(4) 


L20 "+ 


L 22 > 


I 


(y 2 c;U 2 p\TcQ)+I{U 2c U 2p ;Tp\TcQ) = 






Rlln 


< 


I{Vi p ;Y 3 \U 2c V lc U lp U lc T p T c Q) = h 


(6) 


RlOd 


+- RlOn 


+ 


Rlld + Rlln + L 2 o + i?20 








< 


HU^VipVicUipUicTpTc-.YslQ) = I 5 


(7) 


RlOn 


+ Rlld 


+ 


Rlln 








< 


I(V lp V lc U lp T p ;Y 3 U 2 c\U lc T c Q) = I 6 


(8) 


Rlld 


+■ -Rlln 


< 


I(Vi p U lp T p ;Y 3 U 2c \V lc U lc T c Q) = I 7 


(9) 


Rlld 


+■ -Rlln 


+ 


L20 + R20 








< 


I(U 2c V lp U lp T p ;Y 3 \V lc U lc T c Q) = I s 


(10) 



Rwn + Rim < I(V lc V lp ;Y 3 \U 2 cU lp U lc T p T c Q) = I 9 (11) 
Riin+L 2Q +R 20 <I{Vi p U 2c ;Y 3 \Vi c Ui p Ui c T p T c Q) = / 1( fl2) 

RlOn + Rlld + Rlln + L 2 Q + R20 

< I{U 2c Vi p Vi c Ui p T p ;Y 3 \Ui c T c Q) = In (13) 

RlOn + Rlln + L 2 Q + R 2 

< I(U 2c V lp V lc ;Y 3 \U lp U lc T p T c Q) = I 12 (14) 
L w + R 20 < /(C/ 2c ; Y 4 U 2p \V lc U lc T c Q) = I 13 (15) 
i 22 + R22 < I(U 2p ; Y 4 U 2c \V lc U lc T c Q) = I u (16) 

RlQd + RlOn + L 2 q + R 2 Q + L 22 + R 22 

< I(U 2c U 2p Vi c lIicT c ; Y 4 \Q) = I 15 (17) 
Rwn + L 20 + R 20 < I(U 2c V lc ;Y 4 U 2p \U lc T c Q) = 7 16 (18) 
RlOn + L 22 + R 22 < I(U 2p Vi c ;Y 4 U 2c \Ui c T c Q) = / 17 (19) 

RlOn + L 2 q + i?20 + -^22 + -R 2 2 

< I(U 2c U 2p Vi c ; Y 4 \U lc T c Q) = J 18 (20) 
L 2 q + R 2 o + L 22 + R 22 

< I{U 2c U 2p ; Y 4 \Vi c Ui c T c Q) = J 19 (21) 
Rud < I(Ui p ; Y 2 \U 2c U 2p Ui c T p T c Q) = I 20 (22) 

Riod + Riid < I(Ui c Ui p ; Y 2 \U 2c U 2p T p T c Q) = I 21 (23) 

Theorem 1: For any p{.) G Vi the region IZi(Zi) is an 
achievable rate region for the discrete memoryless classical CC- 
IFC (CC-IFC-WD with L = 0), i.e., \j ZieV Tli{Zi) ^ Co- 
Remark 1: Consider the case where cognitive user can not 
overhear the channel, i.e. Y 2 = 0. If we set T c = T p = Ui c = 
Ui p = 0, L 20 = L 22 = R wd = Rii d = and Vi p = X\, the 
rate region reduces to Han-Kobayashi (HK) region (3J. 

Remark 2: If we omit reciever 2, i.e. Y 4 = 0, and cognitive 
user does not have any message to transmit, i.e. R 2 = 0, the model 
reduces to the relay channel. By setting T c = T p = Ui p = Vi p = 
U 2p = 0, L 2Q = L 22 = Rii n = Rud = R 2 = and U 2c = 
X 2 , the rate region reduces to the partial DF rate for the relay 
channel [9|, which includes the capacity regions of the degraded 
[ 9 1 and semi-deterministic relay channels 1181 . 

Outline of the Proof: We propose the following random 
coding scheme, which contains regular generalized block Markov 
superposition coding, rate splitting and GP coding in the encoding 
part. For decoding at the receivers we utilize backward decoding. 
Messages of the primary and cognitive users are split into four and 
two parts, respectively, i.e.: mi = (raiQd, Tiion> TO lld) wn n ) 
and m 2 = {ni 2 o, m 22 ), where subscript d (or n) refers to the part 
of the primary user's message which can (or can not) be decoded 
by the cognitive user. Moreover, (mio, TO20) and (mil) TO 22 ) are 
common and private messages as in the HK scheme [3j. miod 
should be decoded at Rx2 (besides its intended receiver), there- 
fore binning against m\od at me cognitive transmitter provides 
no improvement. Hence, cognitive user cooperatively with the 
primary user sends miod, while uses GP binning method against 
mud to mitigate the effect of this known interference at Rx2. 
Now, consider a block Markov encoding scheme with B blocks 
of transmission, each of n symbols. 

Codebook Generation: Let q n be a random sequence according 

n 

to Y[ p(li)' Generate 2 nRl0d independent identically distributed 

i=l 

n 

(i.i.d) £" sequences, each with probability Yl p(t c ,i\<li)- Index 

i=l 

them as i"( TO iod) where m iod ^ [l,2 nRl0d ]. For each 



t™(m' wd ), generate 2 nRlld i.i.d t™ sequences, 2 nRl0d i.i.d 
it" c sequences and 2 nRl0n i.i.d v" c sequences, according to 

n n n 

II P(tp,i\t c ,i,qi), Y[ P{ u icA t c,i,1i) and [] p(vi Ct i\t c> i,qi), 

i—1 i—1 i—1 

respectively. Index them as t p (m' lld ,m' wd ), it™ c (miod, m' 10d ) 
and v^ c (m Wn ,m' d ) where m' nd £ [l,2 nRlld ], 
m wd £ [l,2 nRl0d ] and m 1Qn £ [1, 2 nRwn ]. For 
each K(miod, m' 10d ), t^(m' nd , m' wd ),t^(m' wd )), 

generate 2 nRlld i.i.d M™ p sequences, according to 

n 



EI P{uip,i\ u ic,i,tp,i,t c ^, qi). Index them as u^ p (miid,mi 0d7 

i—1 




v™ p sequences, according to ]J p(vi Pt i\vi Cti , t Pii ,t Cti , qi). 

i—1 

Index them as w™ p (mn„, m Wn , m' lld , m' 10d ) where mu n £ 
[l,2 nRll »]. From the p.m.f in (O, compute the marginals 
p{u 2c \t c ,q) and p(u 2p \t c ,q) (drop the dependence on t p ). 
For each t^(m' wd ), generate 2™^ 20+i2 °) i.i.d u 2c sequences, 

n 

each with probability Yl P( u 2c,i\t Ct i, qi). Index them as 

u 2c {[m 20 ,l 2 o],m' wd ), where m 20 £ [l,2 nR ' 2 °] and l 20 £ 
[l,2 nLw ]. For each t™(m' wd ), generate 2 ni - R ^+ L ^ i.i.d u\ 

n 

sequences, according to Y\ p( u 2p,i\tc,iiQi)' Index them as 

u%([m 2 2,l22],m' Wd ) withm 22 £ [1,2™^] , l 22 £ [l,2 nL ™\. 
Encoding (at the beginning of block b): 

Primary User: In order to transmit the message 
mi,6 = (miod,b,™iOn,b,™iid,b,™iin,b), Txl picks codewords 
Vi p (mii„,b , mion,b , miid,b-i , miod,b-i) , Vi c (mio n ,b , m-iod.b-i), 

u lp( m lld,b,miQd,b,mild,b-l, miOd,b-l) ,Ulc( m Wd,b, rniQd,b-i), 

tp{miid,b-i,m 10 d,b-i),t"(m 10 d,b-i)- Then, sends a;™ generated 

n 

according to f] p(xi ti \vi Pii , v lc , iy Ui Pti , u lc4 , t Pti , t c ,i, Qi). In 
i=i 

the first block cooperative information is: (mnd,b-i, rniod,b~i) = 
("iiid.o, n^iod.o) = (0,0) and in the last block, a previously 
known message (mnd,B, "T-iod.s) = (1, 1) is transmitted. 

Cognitive User: Tx2 at the beginning of block b, knows 
"T-iOd.fj-i and TOnd,6-l. which are estimates of the parts of 
the common and private messages sent by Txl in the previous 
block and can be decoded by the cognitive user. In order to 
send m 2 _h = (^20.bj tn 22 ^), encoder 2 knowing codewords 
ip("T-iid,&-i,TOiod,b-i) and t"(miod,6-i), seeks an index pair 
(ho,b,l22,b) such that 

( u 2c([ m 20,b, ^ 2 0,b], Wiod,f>-l)j U 2p ([m 22 ^, Z 2 2,b] , TOlOd,b-l), 

t p (rhu d ,b-i, m 10 d,6-i), i"(miod,b-i), q n ) £ K (24) 

If there is more than one such index pair, pick the smallest. If there 
are no such codewords, declare an error. There exist such indices 
^ 2 o,b and /2 2 ,b with enough high probability, if n is sufficiently 
large and ©-(lUl hold. Then, it sends x 2 generated according to 

n 

EI P( x 2,i \u 2p ,i, u 2c ,i, t p ,i, t c ^i, qi). 

i=l 

Decoding: Cognitive User: Tx2 at the end of block b, wants to 
correctly recover (mn^j, rniod,b)- Hence, it looks for a unique 
pair (rhnd,b, rn wd ,b) such that 

(2/2 (b), u1 p (mud,b, m wd ,b, mi ld ,b-i, m wd ,b-i), 
Ui c (m W d,b, ™iod,b-i), w 2c ([m 20 ,b, ^ 2 o,b], m-iod,b-i), 



u 2p([ m 22,b, f22,b],mi0d 1 &-l),fp(mild ) &_l,TOl0d,b-l),(25) 

C(miod, fe -i),g n ) G A™ {Y 2 ,U 2c ,U 2p ,U lp ,U lc ,T p ,T c ,Q) 

This step can be done with small enough probability of error, 
for sufficiently large n if d22b -d23b hold. 

Backward decoding is used at the receivers, hence they start de- 
coding after all B blocks are received. Rxl: In block b, Rxl looks 
for a unique quadruple (mu„ j6 , mi 0n ,&, m U d,&-i, miod^-i) 
and some pair (77120,61 120, b) such that 

(7/3 (6), 7i2 c ([m 2 0,6, ^20,b], 77ll0<i,b-l), ^("^On.b, 77ll0d,6-l), 

Ui p (m-iin,6) ™i0n,6, m-iid.t-i , m W d,b-i), (26) 

u l p ( m lld,b, mwd,bi 77lnd,6-l, Wl0d,6-l), ( TO 10d,6-l), 
w lc( m 10d,6,?71lOd,6~l),ip("T-ii rf , fc _i,77liod,b-l),(7™) G A" 

where (mxid,6, 77ixod,6) were decoded in the previous step. Here, 
for large enough 71, the probability of error can be made suffi- 
ciently small if (f6t-<Tl~4b hold. 

Rx2: In block b, Rx2 finds a unique pair (77120,61 ?^22.t>) and 
some quadruple (feo,&, ^22,6, 7nio„,6, ^I0d,6-i) such that 

( u 2c([ TO 20,6, ^20,6]) 77liod,6-l), U2p([ m 22,b, ^22, b] , 77liod,6-l), 
7>lc( m 10n,6, miod,6-l), Ui c (mwd,b, mwd,b-l), <™ ( m 10d,6-l), 

<f, y?(6)) G A: (Y 4 , U 2c , U 2p , V lc , U lc , T c , Q) (27) 

where miod,6 was decoded in the previous step. With arbitrary 
high probability, no error occurs in Rx2 if n is large enough and 
Qfj-lEB hold. " ■ 

Now to understand the shape of the achievable region, we give 
a compact expression for 1Z\ [Z%) which is easier to compute. 

Corollary 1: The region 1Z\ (Z\), after Fourier-Motzkin elimi- 
nation, can be expressed as: 

Ri < min(^min(hi + I2 + ha, hi + I12, h) — h, 

h\ + min(I 2 + In - h, h)) 
R2 < min(^hg, I-n + min(h , hz)) — l[ 

Ri + R,2< min^hi + h, hs + min(h, h — h), 

hi + In + min(ho, l'i + I13), 

hi + hi + min(h2, l' 2 + he, ho + In - h), 

l' 2 + min(hi + Ii8,ho + I15)) - I\ 

2Ri +R 2 < min{l'2 + hs + I3, hi + 2J 4 + In + he, 

h + In + rnin(hi + I12, h)\+ hi - l'i 

Ri + 2R 2 < min(^hi + ho + hi + min(Ii4 + he, h$), 
hi + In, + min(ho + ho, h)j - 2l[ 
2Ri + 2R2 < min(^h + min(Ii4 + hi, In + h), 

ho + hi + Is) + hi + hs - 2I[ 

2R X + 3R 2 < hi+ ho + 2/i4 + hi + hs - 3/( 
3Ri + 2R 2 < 2/21+2/4 + hi +17 +/15 - 2I[ 

subject to h < min(I%, ho, I\ 2 , /13, I\§) and I 2 < In, where 
{h, 1 < i < 21} were defined in j|3ll-il231>. and /( = max(Ii + 
h, h), 1*2 = min(I 10 - and I 3 = min(I 6 , J n - h). 



B. CC-IFC without delay (L = I) 

In this case, cognitive user could utilize the current received 
symbol as well as the past ones in order to cooperate with the 
primary user or reduce the interference effect. In addition to the 
scheme used in Theorem Q] IR is employed to achieve higher 
rates using this additional information. Consider auxiliary RVs 
T c , T p , Uic, Ui p , Vic, Vi p , \J2c1 U2p, V2 and Q defined on arbi- 
trary finite sets %, T p , Ui c , Ui p , Vi c , Vi p , W 2c , U 2p , V 2 and Q, 
respectively. Let Z2 — (Zi, V2), and V2 be the set of all joint 
p.m.fs p{.) on Z2 that can be factored in the form of 

p(z 2 ) = p(q)p(t c \q)p{t p \t c , q)p(uic\t c , q)p{u lp \u lc , t p , t c , q) 
xp(vi c \t c , q)p{v lp \vic, t p , t c , q)p{xi\v lp , v lc , u lp , u lc , t p , t c , q) 
xp(u 2c , u 2 p\t p , t c , q)p(v2\u 2 c, u 2p , tp, t c , q)p{x 2 \v2,V2, <?)(28) 

In fact X2 — f 2 (v 2 ,y 2 , q), where f 2 {.) is an arbitrary deter- 
ministic function. Let 1Z 2 (Z 2 ) be the set of all nonnegative rate 
pairs (Ri,R 2 ) where R x = R wd + i?i „ + R\id + -Riin 
and R 2 = R 2 q + R 22 , such that there exists nonnegative real 
(£20, £22) which satisfy (f3T> - <f23t . 

Theorem 2: For any p{.) G V 2 the region 1Z2{Z2) is achiev- 
able for the discrete memoryless CC-IFC without delay (CC-IFC- 
WD with L = 1), i.e., \Jz 2 ev 2 ^2(^2) Q C x . 

Proof: The achievability proof follows by combining the 
scheme used in Theorem[T]and IR. Encoding and decoding follow 
the same lines as Theorem [T] except that during the codebook 
generation at the cognitive user (Tx2), v 2 is generated accord- 

n 

ing to p{v 2 ,i\u2p,i,U2c,i,t P} i, t c ,i,qi), and in the encoding 

i=l 

session, Tx2 at time i and upon receiving 7/2. i, sends X2,i = 

/2,;(«2,i,2/2,i>&)- ■ 

Remark 3: If we assume that V2 has extended alphabet of size 

I X2 1 ' (all mappings from 3^2 to X 2 ), this scheme is analogous to 
Shannon 's strategy of cancelling the causally known interference 

ED, HD. 

Remark 4: This scheme is feasible for any L > 1. Moreover, 
/2O) can be extended to x 2:i = f 2yl (v 2A ,y 2:i , y 2 ,i+L-i, 

Remark 5: Nullifying T c = T p — U\ p = V\ p — U 2p , setting 
R2 = L 20 = L 22 = R nn = R lld = and U 2c = V 2 , the 
region reduces to the partial DF rate for RWD [ 14, Theorem 2.5]. 

C. CC-IFC with a block length delay (L = n) 

In this part, we investigate CC-IFC with a block length delay 
(L = 77). This means that cognitive user knows its entire received 
sequence non-causally. We derive an achievable rate region using 
a coding scheme based on combining non-causal partial DF, rate 
splitting and GP binning against part of the interference. Consider 
auxiliary RVs U\ c , Ui p , Vi c , Vi p , U 2c , U 2p and a time sharing RV 
Q defined on arbitrary finite sets 1lic,Uip, Vi c , Vip,U 2c ,U 2p and 
Q, respectively. Let Z 3 = (Q, Ui c , Ui p , Vi c , V lp , U 2c , U 2p , Xi, 
X2, Y2, Y3, 14), and V3 denote the set of all joint p.m.fs p(.) 
on Z3 that can be factored in the form of (0 with (t p ,t c ) = 
(ui p ,Uic). Let 75.3(^3) be the set of all nonnegative rate pairs 
(Ri,R 2 ) where Ri = R wd + R Wn + Rn d + R Un and R 2 = 
R20 + -R22 such that there exists nonnegative real (£20^22) 
which satisfy (©-(HD with (T p , T c ) = (U lp , U ic ) and: 

Rud<I(U lp ;Y 2 \U lc Q) (29) 
Rwd + Riid < I{U lc U lp ; Y 2 \Q) (30) 



Theorem 3: For any p(.) € V3 the region TZ 3 (Z 3 ) is achiev- 
able for the discrete memoryless CC-IFC with a block length delay 
(CC-IFC-WD with L = n), i.e., \Jz 3 ev 3 ^3(^3) Q C n . 

Proof: Proof is similar to Theorem [TJ except that there is 
no dependence on previous block messages. Hence, simultaneous 
joint decoding is used instead of backward decoding. ■ 

IV. Capacity of Degraded classical CC-IFC 

In this section, we investigate classical CC-IFC (CC-IFC-WD 
with L = 0), with joint p.m.f p*, given by ([TJl with L = 0. Using 
the achievable region in Theorem[TJ we find the capacity region for 
a special case. We define degraded classical CC-IFC as a classical 
CC-IFC where degradedness condition for the Txl-Rxl pair with 
cognitive user as a relay holds for every p* : 

p(ya\xi,x2,y2) = p(y3\x 2 ,y 2 ) (31) 

i.e., Xi — > (X 2 ,Y 2 ) — > Y3 forms a Markov chain. Next, we 
impose the following strong interference conditions: 

I{X\\Y 3 )<I{XxYi) (32) 
/(X 2 ;F 4 |Xi)</(X 2; y 3 |Xi) (33) 

In fact, under these conditions interfering signals at Rxl and 
Rx2 are strong enough to decode both messages. 

Theorem 4: The capacity region of the degraded classical CC- 
IFC with joint p.m.f p* , satisfying < |32b and d33l l, is given by 

C *= U {(Ri,R 2 ):Ri>0,R 2 >0 

p(t)p(x 1 \t)p(x 2 \t) 

Ri < {X X ;Y 2 \X 2 T) (34) 
R 2 < I(X 2 ;Y 4 \X 1 T) 

Ri+R 2 < min{I{X l X 2 ;Y 3 \I{X x X 2 - 1 Y i )}] 

Proof: Achievability: For this part, we use the region TZ\ in 
Theorem[TJ(or Corollaryljjl and we ignore time sharing RV Q. Let, 
T p = Ui p = V%p = U 2p = and R 22 = R nn = R lld = 0, 
which cross out the private parts of both messages, making the 
messages common to both receivers. Furthermore, assume that 
cognitive user fully decode the message of the primary user (mi). 
Hence, set Rio n = and V\ c — 0. To omit the GP coding, we set 
L 2 q = L 22 = 0. Redefining T c — T, U 2c — X 2 , U\ c — X\ and 
applying condition d33b complete the proof for the achievability. 

Converse: Consider a (2 nRl , 2 nR2 , n) code with P e — > 0. For 
1 < % < n, define the RV T t = Y^ 1 . Noting joint p.m.f p* , 
we remark that X\ — > T — > X 2 forms a Markov chain. First, we 
provide a useful lemma which we need in the proof of converse. 

Lemma 1: If 02b and J33b hold for all distribution p* , then 

H.XZ;Y?)<I(X?;Y?) (35) 
I(XS; Y 4 n \X?)<I(XZ; r 3 "|XD (36) 

Proof: Proof follows the same lines as in (5J Lemma 5]. ■ 
Noting the independence of the messages and utilizing Fano's 
inequality for the first bound, we have: 

nRi - nS ln < I(M i; Y 3 n \M 2 ) < 7(M X ; Y£Y?\M 2 ) 

n 

( => IiM^Y^Y^Y^Y^HhX^l) 
1=1 

( C ) JL 

i=l 



— Y?, t iY 2 ^\X 2> i, Tj) 

i=l 

where (a) and (c) due to the fact that mutual information is non- 
negative, (b) obtains from the chain rule and the fact that X 2 ^ 
is a deterministic functions of M 2 and Y£ and (d) because 
channel is memoryless. Using standard time-sharing argument and 
condition ( Bil l, we have 

Ri-5 ln < I(X i; Y 3 ,Y 2 \X 2 ,T) = I(X i; Y 2 \X 2 ,T) 

Similarly, applying Fano's inequality, we bound R 2 as: 

nR 2 - nS 2n < /(Ma; F 4 "|Mi) < /(Mi; Y^\M X ) 

n 

( => KM 2 ; Y iti , Y^Yt 1 ^- 1 ^^ 

i=l 
(b) « 

< I ' -\h..\j , ; Y 2 , i \Yl- 1 ,T i , Mi, X 1A ) 

8=1 
(C) " 

< ^/(X 2 , J ;K 1 , l |^i,„I 1 

i=l 

where (a) is based on the chain rule, (b) since X\ i is a determin- 
istic functions of Mi and mutual information is non-negative and 
(c) because channel is memoryless with joint p.m.f p* . 
Now, we utilize Fano's inequality to bound i?i + R 2 as: 

n(R 1 + R 2 ) - nS 3n < /(Mi; Y 3 ") + /(M 2 ; Y?\M X ) 
</(Xf;Y 3 n ) + /(M 2 ;y 4 "|Mi) 

< I(X[ l ; y 3 ") + /(M 2 , XI, F 4 "|Mi, X») 

< I(X?;Y?) + I{XS; Y 4 n \X?) (38) 
%I(X?;Y?) + I(XS;Y?\X?) 

n 

- TO F 3 ") < 1(Xi ti X 2<i ; F 3)i ) 

i=l 

where (a) and (b) follow from the non-negativity of the mutual in- 
formation and the deterministic relation between X" and Mi , (c) 
follows from fact that conditioning does not increase the entropy 
and the conditional independence between Y4 and (Mi,M 2 ) 
given (X\, X 2 ), and (d) holds due to P61 >. 

Finally, utilizing Fano's inequality, d38k) and condition i35\ . 
the last bound can be shown. Using standard time-sharing argu- 
ment for these bounds, completes the proof. ■ 

V. Gaussian CC-IFC-WD 

We consider Gaussian CC-IFC-WD and extend the achievable 
rate regions IZi(Zi), 1Z 2 (Z 2 ) and TZ 3 {Z 3 ) derived for the dis- 
crete memoryless classical CC-IFC (/ = 0), CC-IFC without 
delay (L — 1) and CC-IFC with a block length delay (/ = n), 
respectively, to the Gaussian case. 

Gaussian CC-IFC-WD at time i — 1, n, can be modeled as 

Y 2< i = h 2 \Xis + Z 2< i 

= h 31 X hi + h 32 X 2ti + Z 3 , t (39) 
Yi t i = h^Xij + h,4 2 X 2: i + Z 4j i 

where, h 2 i, hsi, h 32 , /i 4 i and /i 42 are known channel gains. X\^ 
and X 2 j are input signals with average power constraints Pi and 



P 2 , respectively. Z 2 ^,Z% t i and Z^i are i.i.d and independent zero 
mean Gaussian noise components with powers N 2 , N3 and N4., 
respectively. Note that, at the secondary user we have a set of 
encoding functions x 2< i = f 2 ,i(jn 2 ,y2~ 1+L ) f° r 1 < i < 
and m 2 G M 2 . 

First, we consider Gaussian classical CC-IFC (L — 0). Using 
standard arguments, region 1Z\ in Theorem Q] (or Corollary Q} 
can be extended to the discrete-time Gaussian memoryless case 
with continuous alphabets {TZ\). Hence, it is sufficient to evaluate 
the (f3T>-d23b with an appropriate choice of input distribution. 
We constrain all the inputs to be Gaussian and set the time 
sharing RV Q = 0. For certain {0 < /3 r < 1, r G {1, 2, 3, 4}}, 
{0 < t3' s < 1, a G {1, 2}} and {0 < j t < 1, t G {1, 2, 3}} with 
f3[ +/3i+/3 2 +/3 2 +/3 3 + ^4 < land 71 +72 +73 < 1, consider 
the following mapping (MAP\) for the generated codebook in 
Theorem Q] with respect to the p.m.f (f2]i, which contains rate 
splitting, generalized block Markov superposition and GP coding: 

T c ~ jV(0, /3 4 Pi), T' v ~ AA(0, /3 3 Pi), t/ lc ~ W(0, /3 2 Pi) (40) 
~ fV(0, APi), ~ fV(0, ^Pi), ^ ~ AA(0, ^PiX41) 
T p = r; + T c , C/ lc = (7( c + T c , 7i c = 7/ c + T c (42) 

c/ lp = ^ + u' lc + t; + t c , v lp = v; p + v; c + t; + t c (43) 
x 1 = v( p + v; c + u' lp + u{ c + t; + t c (44) 

U' 2c ~ AA(0, 71 P 2 ), f/ 2p ~ AA(0, 72P2) (45) 
Si = h 41 T p , S 2 = h A1 T p + (46) 
U 2c = £4 + aiSi , U 2p = U 2p + a 2 S 2 (47) 

X 2 = U' 2p + U' 2c + sjj^T c (48) 

where we have oti = A hi ^ lP2 D and a 2 = /i,42 P P2 wherein 
A = N 4 + h 2 M + | x + + fa)Px + h\ 2ll P 2 + 
(^41 V PaP\ + /142V73P2) ■ in fact, optimal values for a\, a 2 , 
Si and ^2 (used for GP coding) can be found by optimizing the 
rate region. However, this method is cumbersome and we use the 
modified version of Costa's dirty paper coding (DPC) results 1211 . 

Next, we investigate Gaussian CC-IFC without delay (L = 1), 
i.e. X 2 = f2{m2,Y 2 ). Considering Theorem |2] «2,i is gen- 
erated according to p(v 2ti \u 2p ^,u 2Cyi ,t p . i ,t c . i ,q l ), and x 2 . t = 
f~z i( v z,ii 2/2,1, Qi)- In order to obtain the Gaussian counterpart of 
7£-2, namely 1Z 2 , appropriate mapping (MAP 2 ), consists of ( 140b - 

m, Vi = C/^+C/^+y^f T c andX 2 = h(f3Y 2 + {l-(3)V 2 ) 
where < /3 < 1 and h is a normalizing parameter. 

Finally, we consider the Gaussian CC-IFC with a block length 
delay (L — n). We can use 1Z\ to obtain the Gaussian counterpart 
of 7?-3 , namely 7^3 . The only difference is that according to The- 
orem [3] there is no dependence on the previous block messages. 
Therefore, we can set T c = U\ c and T p — U\ p , or equivalently 
j3' 2 = j3 2 = in (MAPx) to obtain MAP 3 . 

Fig.[2]compares TZ*, TZ*, TZ* with HK region in [3 1, where the 
overheard information is neglected. For L = 0, rate improvement 
over HK region can be seen, especially when cognitive link is 
good enough (h 2 i = 4). Due to IR, even when h 2 \ = 1, TZ 2 
outperforms both 1Z\ and HK region, significantly. Considering 
TZ\ and IZ^, it is seen that when i?2 is small, IR can achieve 
higher rates than non-causal DF. However, when cognitive user 
sends at higher rates, condition of the cognitive link determines 
better strategy. Note that, using a coding scheme based on the 
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Fig. 2. Comparison between TZ^ ^3 ar "l HK region | 3). P\ = P2 = 6, 
/131 = h 42 = 1, h Z 2 = hn = V0.55 and N 2 = N 3 = N A = 1. 



combination of IR and non-causal DF, convex hull of TZ 2 and 7^.3 
is achievable for CC-IFC with L = n. 
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